cnn-tdnn-f for switchboard by GaofengCheng · Pull Request #50 · danpovey/kaldi

GaofengCheng · 2018-08-09T07:05:03Z

No description provided.

…asr#2573) If a single phoneme is aligned to the whole utterance, it is counted both as `begin` and `end`, but is added to the total only once. This caused `assert count >= 0` in analyze_phone_length_stats.py to fail. Now only the `begin` is counted in that case.

…ike fsts-union) (kaldi-asr#2562)

…ments for OCR tasks (kaldi-asr#2579)

…di-asr#2580)

…di-asr#2578)

…es in OCR … (kaldi-asr#2587)

…aldi-asr#2586)

…d die if not (kaldi-asr#2591)

…aldi-asr#2596) OpenFst 1.6.7 does not build with 4.8.1, and 4.8.2 has an stl bug that is fatal for Kaldi.

…tc.) kaldi-asr#2595 (kaldi-asr#2597)

…#2611)

GaofengCheng · 2018-08-10T01:47:34Z

Hi Dan,
the results did not change.
This update is a fix that in the first commit I used the wrong xconfig.
This update corrects the xconfig.
Gaofeng

danpovey · 2018-08-10T01:52:08Z

ok thanks.

…

On Thu, Aug 9, 2018 at 6:47 PM, Gaofeng Cheng ***@***.***> wrote: Hi Dan, the results did not change. This update is a fix that in the first commit I used the wrong xconfig. This update corrects the xconfig. Gaofeng — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#50 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ADJVu7C6BkVSDNNXJBcc0ShJ-67J3us1ks5uPOY2gaJpZM4V1J2P> .

…#2610)

…sr#2600)

…nk_history.py (kaldi-asr#2613)

…arbage if PCA failed. (kaldi-asr#2590)

danpovey · 2018-08-20T18:43:11Z

egs/swbd/s5c/local/chain/tuning/run_cnn_tdnn_1a.sh

+  conv-relu-batchnorm-layer name=cnn3 $cnn_opts height-in=40 height-out=20 height-subsample-out=2 time-offsets=-1,0,1 height-offsets=-1,0,1 num-filters-out=128
+  conv-relu-batchnorm-layer name=cnn4 $cnn_opts height-in=20 height-out=20 time-offsets=-1,0,1 height-offsets=-1,0,1 num-filters-out=128
+  conv-relu-batchnorm-layer name=cnn5 $cnn_opts height-in=20 height-out=20 time-offsets=-1,0,1 height-offsets=-1,0,1 num-filters-out=128
+  conv-relu-batchnorm-layer name=cnn6 $cnn_opts height-in=20 height-out=20  time-offsets=-1,0,1 height-offsets=-1,0,1 num-filters-out=128


sorry, I didn't look at this before. Can you try a version where the height-out of cnn5 and cnn6 is 10, not 20, and their num-filters-out is 256? This will leave the compute time about the same (while increasing the parameters), and will allow those layers to see a wider range of frequency. So reducing the height (and increasing the num-filters) actually increases the modeling power.

OK will try

tdnn7q_sp cnn_tdnn1a_sp cnn_tdnn1a_more_filters_sp

WER on train_dev(tg) 12.08 12.13 11.97
WER on train_dev(fg) 11.15 11.16 11.12
WER on eval2000(tg) 14.1 14.1 13.9
WER on eval2000(fg) 12.8 12.6 12.5
WER on rt03(tg) 17.5 17.3 17.1
WER on rt03(fg) 15.3 14.9 14.9
Final train prob -0.055 -0.057 -0.056
Final valid prob -0.072 -0.075 -0.075
Final train prob (xent) -0.875 -0.877 -0.871
Final valid prob (xent) -0.9064 -0.9134 -0.9110
Num-parameters 18725244 14597020 15187100

Great! So that's the setup with more filters and height-out=10 on the last 2 layers, then?
In that case I think you should just change your 1a to be that configuration, and we could mege that.

…ingran

…ingran (kaldi-asr#2634)

…rgets.sh (kaldi-asr#2635)

…ere is no spk info (kaldi-asr#2639)

…s per speaker) (kaldi-asr#2642)

…asr#2581) This came from Vimal's work on the MGB-3 challenge. Interface is similar to the existing GMM-based cleanup/segmentation scripts.

…_tdnn_f_cgf_local

eginhard and others added 21 commits July 25, 2018 13:59

[egs] Update TDNN-F script for Librispeech (kaldi-asr#2574)

97b78df

[src] Minor code-style fixes to context-dependency code (kaldi-asr#2576)

5391f00

[src] Add binary fsts-concat to concatenate Kaldi archives of FSTs (l…

d4d968c

…ike fsts-union) (kaldi-asr#2562)

[egs] Set the acoustic scale properly when generating e2e chain align…

b8fd2cd

…ments for OCR tasks (kaldi-asr#2579)

[scripts] Fix typo in steps/cleanup/decode_fmllr_segmentation.sh (kal…

ec71be4

…di-asr#2580)

[src] added configure support for cuda on arm64 (kaldi-asr#2577) (kal…

5cc9731

…di-asr#2578)

[egs] Cosmetic fix in aishell recipes (kaldi-asr#2582)

5b27111

[scripts] Call the right script when fixing/validating data directori…

68c926a

…es in OCR … (kaldi-asr#2587)

[src] fix embedding training bug for RNNLM without letter features (k…

04027ee

…aldi-asr#2586)

[src] Make vector-compute-plda check that num-ivectors > plda-dim, an…

287e249

…d die if not (kaldi-asr#2591)

[build] Update check_dependencies.sh so lowest required GCC is 4.8.3 (k…

532f384

…aldi-asr#2596) OpenFst 1.6.7 does not build with 4.8.1, and 4.8.2 has an stl bug that is fatal for Kaldi.

[scripts] Fix to script usage message (kaldi-asr#2601)

d10540c

[src] Correct usage message of acc-lda (kaldi-asr#2598)

fec68b2

[scripts] RNNLM script fix: to accept successive spaces in configs (e…

bee1022

…tc.) kaldi-asr#2595 (kaldi-asr#2597)

[scripts] Slight cleanup in lmrescore_rnnlm_lat.sh (kaldi-asr#2554)

f40fa5c

[src] Fix Windows out-of-range iterator issue for nnet3 (kaldi-asr#2594)

8e97639

cnn-tdnn-f for switchboard

d7fc747

[src] Update Windows installation instructions (kaldi-asr#2607)

b4a8e9b

update

6f6a6da

[egs] Fix to LibriSpeech download script [affects 2nd run] (kaldi-asr…

a932be6

…#2611)

psmit and others added 7 commits August 9, 2018 23:09

[src] Change RNNLM test program to clean up temporary file (kaldi-asr…

c60f212

…#2610)

[egs] Add a BPE-based recipe for IAM handwriting recognition (kaldi-a…

6926b60

…sr#2600)

[egs] Fix TEDLIUM v3 data download (kaldi-asr#2609)

94e561a

[egs] Update LM in heroico recipe; fix bug in utils/lang/limit_arpa_u…

d60404b

…nk_history.py (kaldi-asr#2613)

[src] Fix bug in PLDA scoring for diarization.. crashed or produced g…

7aa9da5

…arbage if PCA failed. (kaldi-asr#2590)

[src] Refactor CUDA allocator code based on large cached regions (kal…

8c0e3e3

…di-asr#2593)

[build] Add missing dependencies to Dockerfiles (kaldi-asr#2622)

cd27a83

ajaech and others added 5 commits August 14, 2018 15:35

[scripts] Usage-message fix in RNNLM tools (kaldi-asr#2623)

2148d09

[scripts] python3 compatibility fix to log_parse.py (kaldi-asr#2626)

e31776f

[scripts] python3 compatibility fixes in nnet3/chain scripts (kaldi-a…

bba22b5

…sr#2629)

[scripts] subset_data_dir.sh: preserve the comments in STM file (kald…

c999329

…i-asr#2627)

[egs] adding missing callhome data-prep script; thx: Dharmesh Agrawal (…

094d227

…kaldi-asr#2631)

danpovey reviewed Aug 20, 2018

View reviewed changes

danpovey and others added 10 commits August 20, 2018 18:43

[scripts] Fix broken pipe problem in get_egs_targets.sh, thx:@iezhanq…

370ed55

…ingran (kaldi-asr#2634)

[scripts] Another fix for broken-pipe issue in steps/nnet3/get_egs_ta…

34df7e2

…rgets.sh (kaldi-asr#2635)

[scripts] Add basis-fMLLR version of align_lats.sh

b55c369

[scripts] Add basis-fMLLR version of align_fmllr_lats.sh, for when th…

d8a686d

…ere is no spk info (kaldi-asr#2639)

[scripts] Small fix to retry.pl, RE array jobs. (kaldi-asr#2640)

ecb9727

[scripts] Compatibility fixes to basis-fMLLR scripts (write transform…

a56f9b3

…s per speaker) (kaldi-asr#2642)

[src,scripts,egs] Add CNN+TDNN example scripts (kaldi-asr#2643)

0642cb9

[scripts,src] Add cleanup/segmentation scripts based on nnet3 (kaldi-…

ed74857

…asr#2581) This came from Vimal's work on the MGB-3 challenge. Interface is similar to the existing GMM-based cleanup/segmentation scripts.

Commit cnn-tdnn-f with more filters under switchboard

2928bb3

Merge remote-tracking branch 'refs/remotes/kaldi-asr/master' into cnn…

61b8a65

…_tdnn_f_cgf_local

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cnn-tdnn-f for switchboard#50

cnn-tdnn-f for switchboard#50
GaofengCheng wants to merge 43 commits intodanpovey:cnn_tdnnffrom
GaofengCheng:cnn_tdnn_f_cgf_local

GaofengCheng commented Aug 9, 2018

Uh oh!

GaofengCheng commented Aug 10, 2018

Uh oh!

danpovey commented Aug 10, 2018 via email

Uh oh!

danpovey Aug 20, 2018

Uh oh!

GaofengCheng Aug 21, 2018

Uh oh!

GaofengCheng Aug 25, 2018

Uh oh!

danpovey Aug 25, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants

Conversation

GaofengCheng commented Aug 9, 2018

Uh oh!

GaofengCheng commented Aug 10, 2018

Uh oh!

danpovey commented Aug 10, 2018 via email

Uh oh!

danpovey Aug 20, 2018

Choose a reason for hiding this comment

Uh oh!

GaofengCheng Aug 21, 2018

Choose a reason for hiding this comment

Uh oh!

GaofengCheng Aug 25, 2018

Choose a reason for hiding this comment

Uh oh!

danpovey Aug 25, 2018

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants